Microcheckpointing with service processor

ABSTRACT

A method, computer program product, and computer system to maintain high availability of a service processor. An embodiment provides program code with a location of a second service processor (the second service processor is communicatively coupled to the first service processor). The program code stops a virtual machine during runtime, including instruction execution and IO operations, where during runtime, the virtual machine executes one or more processes to service and manage computing resources in the distributed computing environment. The program code generates a micro-checkpoint of the virtual machine. The program code resumes the instruction execution of the virtual machine and transmits the micro-checkpoint to a second service processor based on the location and then resumes IO operations. The second service processor utilizes the micro-checkpoint to enable a hypervisor on the second service processor to start a virtual machine on the second service processor.

BACKGROUND

Checkpoints can be used to preserve the state of a server in detail,enabling users, service personnel, or other servers to capture asnapshot of what the server was doing at that instant of time. In theevent of server failure, the preserved state can be used to resumeserver operations from the point in time of the last checkpoint on analternate or backup server. As the server continues to run, newcheckpoints are captured to help minimize the loss of processed data. Ifthese new snapshots can be captured sufficiently fast, applicationsrunning on the server may continue to run unimpaired without any loss ofdata, of calculated results, or of transactions, in spite of thetransition or failover to a backup server. Service processors areservers in their own right, but with the specialized purpose ofsupporting other servers. Service processors often need to rapidlyrespond to conditions that surface in the servers they are supporting.This rapid response time requirement has made it difficult to designservice processors, where in the event of failure of the primary, thebackup can transparently take over. Checkpoint designs for serverprocessors often require sophisticated techniques to preserver the stateof individual processes, state changes, and data changes, which can behighly complex and error prone to craft and maintain.

SUMMARY

Shortcomings of the prior art are overcome and additional advantages areprovided through the provision of a method for providing highavailability on a service processor in a distributed computingenvironment. The method includes, for instance: obtaining, by aprocessing circuit in a first service processor in a distributedcomputing environment, a location of a second service processor, wherethe second service processor is communicatively coupled to the firstservice processor; stopping, by the processing circuit, a virtualmachine during runtime, where the virtual machine is executed by ahypervisor running on the first service processor, and where duringruntime, the virtual machine executes one or more processes to serviceand manage computing resources in the distributed computing environment,where the stopping comprises stopping instruction execution and stoppingIO operations; generating, by the processing circuit, a micro-checkpointof the virtual machine by invoking a code path of the one or moreprocesses to identify and copy dirty memory into a local staging area onthe first service processor; resuming, by processing circuit,instruction execution by the virtual machine; and transmitting, byprocessing circuit, the micro-checkpoint to a second service processorbased on the location, where a processing circuit of the second serviceprocessor utilizes the micro-checkpoint to enable a hypervisor on thesecond service processor to start a virtual machine on the secondservice processor; and resuming, by processing circuit, IO operations.

Methods, computer program products and systems relating to one or moreaspects are also described and claimed herein. Further, servicesrelating to one or more aspects are also described and may be claimedherein.

Additional features and advantages are realized through the techniquesdescribed herein. Other embodiments and aspects are described in detailherein and are considered a part of the claimed aspects.

BRIEF DESCRIPTION OF THE DRAWINGS

One or more aspects are particularly pointed out and distinctly claimedas examples in the claims at the conclusion of the specification. Theforegoing and objects, features, and advantages of one or more aspectsare apparent from the following detailed description taken inconjunction with the accompanying drawings in which:

FIG. 1 is an example of a technical environments that includes certainaspects of an embodiment of the present invention;

FIG. 2 depicts a workflow illustrating certain aspects of an embodimentof the present invention;

FIG. 3 illustrates this aspect of the present invention with respect toa page table in a service processor;

FIG. 4 is an example of a technical environments that includes certainaspects of an embodiment of the present invention;

FIG. 5 depicts a workflow illustrating certain aspects of an embodimentof the present invention;

FIG. 6 is a timing chart illustrating certain aspects of an embodimentof the present invention;

FIG. 7 depicts one embodiment of a cloud computing node;

FIG. 8 depicts one embodiment of a cloud computing environment; and

FIG. 9 depicts one example of abstraction model layers.

DETAILED DESCRIPTION

The accompanying figures, in which like reference numerals refer toidentical or functionally similar elements throughout the separate viewsand which are incorporated in and form a part of the specification,further illustrate the present invention and, together with the detaileddescription of the invention, serve to explain the principles of thepresent invention. As understood by one of skill in the art, theaccompanying figures are provided for ease of understanding andillustrate aspects of certain embodiments of the present invention. Theinvention is not limited to the embodiments depicted in the figures.

As understood by one of skill in the art, program code, as referred tothroughout this application, includes both software and hardware. Forexample, program code in certain embodiments of the present inventionincludes fixed function hardware, while other embodiments utilized asoftware-based implementation of the functionality described. Certainembodiments combine both types of program code. One example of programcode, also referred to as one or more programs, is depicted in FIG. 7 asprogram/utility 40, having a set (at least one) of program modules 42,may be stored in memory 28.

In traditional computing environments, a service processor does nothandle continuous redundancy for either initialization of the system orof independent services provided by the service processor, including butnot limited to, Advanced System Management (ASM or ASMi) and InitialProgram Load (IPL). This is because checkpoint designs for serverprocessors need to operate at a very high speed which often requiressophisticated techniques to preserve the state of individual processes,state changes, and data changes, which can be highly complex and errorprone to craft and maintain. Thus, embodiments of the present inventionprovide fault tolerance for a service processor such that the serviceprocessor maintains continuous high availability redundancy for any ofthe functions of the service processor, including but not limited to,using the information from connections with remotes servers and serviceprocessor configurations to connect to, and manage, Internet SmallComputer Systems Interface (iSCSI) attached integrated servers,identifying initiator systems utilizing information stored in the remotesystem configuration and the service processor configuration objects,and/or managing and/or communicating with a local area network (LAN)adapter, configuring connections utilizing network server configurationobjects. Embodiments of the present invention enable this improvementthat is inextricably tied to computer technology by moving functionalitythat is generally performed directly by a service processor, examples ofwhich were previously enumerated, in an embodiment of the presentinvention, to a virtual machine (VM), and checkpointing the serviceprocessor during runtime. As understood by one of skill in the art,checkpointing consists of saving snapshots of execution states, so thatone or more programs can restart from that point, in case of failure.Micro-checkpointing is checkpointing at a frequency high enough toprevent loss of application level data or transactions in the event offailover for both single and multi-threaded application environments.

In a further embodiment of the present invention, program code performsthe described checkpointing without utilizing a virtualized environment.Embodiments of the present invention detect and checkpoint the memory inthe service processor that has changed since the last checkpoint (i.e.,the dirty memory). “Dirty memory” is file-backed memory in which thecontents have been modified but not yet written back to disk. Thein-memory version of data is out of sync with the on-disk version and issaid to be dirty. A mechanism called writeback eventually updates theon-disk version, bringing the two in sync.

Memory changes included in the described checkpointing may includememory changes from the last checkpoint for all the VMs running on agiven service processor, as well as the hypervisor memory changesitself. In an embodiment of the present invention, program codeperforming the aspects described, rather than being part of a virtualenvironment, is embodied in an Operating System running on hardware ofthe computing environment. The Operating System may be a known OperatingSystem, including but not limited to, Linux, and/or customized or legacycode/firmware. As understood by one of skill in the art, in anembodiment of the present invention, the program code utilizes existingcode paths to access the dirty memory to create checkpoints that can beutilized to restore the state of a service processor for purposes ofproviding high availability (HA), i.e., continuous operation for adesirably long length of time, to users.

Embodiments of the present invention differ from past attempts atproviding fault tolerance for service processors (as well as standaloneservers) in computing systems. Embodiments of the present inventionutilize micro-checkpointing in service processors. Earlier solutionsthat involve checkpointing, but not service processors, have providedredundancy only by utilizing more than one server to execute the samecommands, which creates inefficiencies and potential for errors with thecomputing system. For example, existing systems are designed withhardware lockstep, often with double or triple redundancy processors,with voting result resolution when one of the processors producesdifferent results. Certain other existing systems differ and are limitedinsofar as where they would be implemented. For example, one existingapproach is limited to running on a single core to provide a continuousavailability high availability implementation, but the existing approachutilizes a record then replay type of checkpointing, where theapplication environment is copied to another server and run slightlydelayed from the primary server application. The checkpoints are done bycapturing IO operations from the primary server and then rerun on thebackup server. The results are checked to make sure they are the same.

Existing systems address memory state micro-checkpointing standaloneLinux servers, to provide a continuous availability high availabilitystructure for applications running on x86 servers. However, thesesystems are not adaptable to provide high availability for serviceprocessors, (i.e., servers that control other servers and/or controllinghypervisor environments running client application environments).Standalone Linux servers running client applications differ from serviceprocessors, which are used to initialize, manage, monitor and controlstandalone servers running client application environments. Serviceprocessors monitor the physical state of the standalone server byconnecting to sensors within the standalone server. The serviceprocessor can detect and diagnose abnormal conditions and then attemptto log information about the failure for use by a system administratoror field support engineer in problem determination and resolution. Theservice processor may also attempt to automatically recover from thefailure. Service processors are also used to interface with and controlhypervisors running on the standalone servers.

Unlike embodiments of the present invention, existing methods ofproviding (limited) high availability and redundancy for serviceprocessors do not support the hotswap (i.e., replacement while thecomputer system remains in operation) of service processors. Currently,high availability for service processors is severely limited andresource intensive to maintain because every critical high availabilityfunction/structure has to be individually designed and hand coded. Thesecustom, hand coded processes are highly customized because of all thesystems that a service processor is responsible for managing andmonitoring. In order to create a process that accurately captures theoperation of a service processor, a developer would require, forexample, a detailed and through understanding of how each sub-functionworks and its interaction, role and dependency on other sub-function.Thus new and complex interaction problems can surface when code ischanged or new functions are added to service processor. Embodiments ofthe present invention eliminate the resource intensive work necessary inthis current design to maintain critical high availabilityfunction/structure because many of these functions no longer depend onindividually designing and hand coding. Instead, as described in detailherein, the memory state changes are captured in a checkpoint. Themicro-checkpointing utilized in embodiments of the present inventionensures that after failover, a backup service processor will be able tocontinue to operate without issue, including in link error scenarios andcorner cases, which are arguably the most complex and problematic in thecurrent design. The micro-checkpointing of the service processor, inembodiments of the present invention, enables support of serviceprocessor hot swap repair, which is not currently supported, due to thecomplexity.

In order to provide high availability, in an embodiment of the presentinvention, the program code executed by a processing circuit in a firstservice processor in a distributed computing environment obtains alocation of a second service processor, where the second serviceprocessor is communicatively coupled to the first service processor.This program code then stops one or more programs executing on theprocessing circuit. These one or more programs are executed on the firstservice processor and during runtime, the one or more programs executeone or more processes to service and manage computing resources in thedistributed computing environment. The program code generates amicro-checkpoint by invoking a code path of the one or more programs toidentify and copy dirty memory into a local staging area on the firstservice processor. The program code then resumes the one or moreprograms. The program code transmits the micro-checkpoint to a secondservice processor, based on the location, where a processing circuit ofthe second service processor utilizes the micro-checkpoint to enable thesecond service processor to start the one or more processes on thesecond service processor.

An advantage provided by certain embodiments of the present invention isthat by moving functionality to a VM or other program executing on oneor more processors of a service processor, during checkpointing, theservice processor will not run into timing conflicts, which causeredundancy to fail. Another advantage of certain embodiments of thepresent invention is a decrease in issues caused by bugs in program codeexecuted by the service processor, because aspects of the implementationof the described functionality in the VM, or other program executing onone or more processors of a service processor decreases, interfaceinteractions with dependent components. Another advantage provided byembodiments of the present invention is that aspects of the describedimplementation of functionality in a VM or another program, decrease thecomplexity of the original synching design utilized when thefunctionality remains in the service processor because both the VMand/or other program can utilize existing code paths and the dirtymemory. Thus, due to the fact that synching no longer needs to occur ona per-process basis within the service processor, but instead on awhole-VM and/or whole program (including all programs included in thecheckpoint) basis, processes are simplified and leave room for fewerunintended errors.

As will be discussed herein, embodiments of the present inventionprovide and apply a memory state micro-checkpoint structure to serviceprocessors to achieve high availability. Service processors have uniquerequirements within computing environments. For example, serviceprocessors service and manage servers, including application servers, ina distributed computing environment, which includes servers that handlethe hypervisor (virtual machine monitor) functions of the hypervisorsthat reside on the servers. For example, such as environments servicedand managed by a service processor, e.g., a flexible service processor(FSP), may include POWER Servers that handle critical Power Hypervisor(PHYP) functions residing on the servers. The aforementioned POWERServers may utilize an error recovery PSI (Processor Support Interface)link, which is a unique interface between the service processor and PHYPrunning on the POWER application server. The FSP in this environment mayalso include a NAND (negative-AND) flash structure that holdsnonvolatile data to capture and handle state changes as part of themicro-checkpoint system of an embodiment of the present invention.Unlike in the aforementioned existing systems to service processorredundancy, embodiments of the present invention do not include a needto design, individually and/or hand code, each critical HAfunction/structure because memory state changes are captured in thecheckpoints, which handle this functionality.

Certain environments into which aspects of certain embodiments of thepresent invention are implemented include computing nodes that areconfigured based on the PowerPC architecture utilized by theaforementioned POWER Servers, also referred to as Power ISA, offered byInternational Business Machines Corporation (IBM®). Elements of theenvironments, as described in Power ISA™ Version 2.07, May 3, 2013, ishereby incorporated by reference herein in its entirety. These technicalenvironments into which embodiments of the present invention areimplemented may include one or more aspects, as well as computingenvironments of other architectures, such as the z/Architecture, offeredby International Business Machines Corporation, and described inz/Architecture—Principles of Operation, Publication No. SA22-7832-09,10th Edition, September 2012, which is hereby incorporated by referenceherein in its entirety. POWER, POWER ARCHITECTURE, POWERPC,Z/ARCHITECTURE, IBM, AIX, POWERVM, Z/OS and Z/VM (referenced herein) areregistered trademarks of International Business Machines Corporation,Armonk, N.Y. Other names used herein may be registered trademarks,trademarks or product names of International Business MachinesCorporation or other companies.

In an embodiment of the present invention, by providing a memory statemicro-checkpoint structure to service processors to achieve highavailability, after a failover, a backup service processor (e.g., backupFSP) can continue operations (e.g., servicing and managing servers inthe distributed computing environment) without experiencing any serviceinterruptions. For example, returning to the example environment,failing over to a backup would not cause the system to experience FSPand link error scenarios and corner cases, which are complex andproblematic issues in existing distributed computing system designs.Thus, the simplification of micro-checkpoints in embodiments of thepresent invention enables system continuity, even if an FSP hot swaprepair is required. Present checkpointing designs, which do not utilizea high level memory structure, do not support this functionality becauseof the potential complexities created by a failover that are avoided bythe centralization of the functionality of a service processor in aprogram executing on the service processor that can utilize existingcode paths to access dirty memory, including, but not limited to, asdescribed in certain embodiments, a VM, and the checkpointing of theprogram, which can be a VM, in embodiments of the present invention.

Embodiments of the present invention provide advantages over knownmethods of checkpointing service processors. For example, the describedimplementation of embodiments of the present invention maintain highavailability for service processor space and this high availability isprovided on a live server. Additionally, embodiments of the presentinvention provide new functions for a service processor, includingcommunication and handshakes, for example, between a primary serviceprocessor and a backup service processor, to be used in the event of afailure of the primary service processor. Embodiments of the presentinvention also provide interdependent locks between two or more physicalservice processors, e.g., the aforementioned primary and backup serviceprocessors.

The use of redundancy is illustrated in FIG. 1, which includes a SourceService Processor 110, a primary service processor, and a DestinationService Processor 120, a backup or secondary service processor. As willbe discussed, the embodiment of FIG. 1, as well as the workflow 200 thatis FIG. 2, discuss how aspects of the present invention can beimplemented in a technical environment 100 (FIG. 1) that includes atleast one hypervisor running at least one VM. However, FIGS. 4-5 provideexamples of embodiments of the present invention that do not include avirtualized environment. These different embodiments of the presentinvention are offered as illustrative examples and not to limit possibleimplementations of the functionality described into certain computingsystems.

To establish the service processor redundancy, as illustrated in bothFIGS. 1 and 4, in embodiments of the present invention, program codeexecuted by a first service processor monitors a sibling serviceprocessor, in order to maintain synchronization to achieve substantiallyuninterrupted service processor functionality during failover andrecovery to the sibling service processor. This interdependency providesan additional advantage as program code needs only calculate one activestate and preserve memory state with no extra calculations related tothe secondary (sibling) processor because the processors aresynchronized. Embodiments of the present invention also preserve thestate of a service processor, for recovery by a secondary serviceprocessor, because preservation of the memory state is substantiallysufficient, including volatile memory, real time clock, CentralElectronics Complex (CEC) hardware content, system model and machinestate.

In an embodiment of the present invention, the program code can detect,capture and communicate processes to a dependent back up processor(e.g., Destination Service Processor 120, FIG. 1, Destination ServiceProcessor 420, FIG. 4) to be used for substantially seamless serviceprocessor recovery. In an embodiment of the present invention, the twoservice processors are synchronized outside of IO. Embodiments of thepresent invention also enable the service processor to manage registryfiles because the synchronization aspect of the present invention (e.g.,the utilization of the two service processors) manages and seamlesslyrecovers service processor registry files, VPD (vital product data) andthe HMC (hardware maintenance console).

Checkpointing in embodiments of the present invention is alsoindependent of code bugs resulting from new code functions being addedor changed. Rather, in embodiments of the present invention, programcode monitors dirty pages (pages whose contents have changed since thelast checkpoint), locating dirty memory 118 128, to capture serviceprocessor memory state and NAND flash state. In embodiments of thepresent invention, by micro-checking the service processor, the programcode removes duplicated component functions.

Embodiments of the present invention interact with non-volatile storage.For example, program code updates the aforementioned NAND flash driveoutside the standard Operating System and VIVI input/output processes(e.g., Linux/KVIVI IO). (The NAND flash drive may utilize a PortableOperating System Interface (POSIX) with a wrapper and a check file to besynchronized, called an Application Programming Interface (API) andwrite a file to a local FSP.)

As aforementioned, FIG. 1 depicts a technical environment 100 thatincludes virtualization, in which aspects of embodiments of the presentinvention may be implemented. Embodiments of the present inventionprovide fault tolerance for a service processor in order to havecontinuous high availability redundancy for any of the functions withina system (not pictured) serviced and managed by the service processor.In order to maintain continuous high availability, embodiments of thepresent invention utilize redundant service processors, for example, theSource Service Processor 110 and the Destination Service Processor 120,illustrated in FIG. 1. Although not illustrated in FIG. 1, technicalenvironments that utilize aspects of an embodiment of the presentinvention may include unique hardware outside of checkpointing thatsynchronizes the Service Processor 110 and the Destination ServiceProcessor 120.

Referring to FIG. 1, in an embodiment of the present invention, programcode executed by at least one processing circuit in the Source ServiceProcessor 110 and the Destination Service Processor 120 loadinstructions for loading and running an Operating System, e.g., bootcode. The Source Service Processor 110 and the Destination ServiceProcessor may execute program code that includes these instructions froma hard drive, a memory, or directly from a processing unit, which may beintegrated into a silicon wafer of an integrated circuit. In anembodiment of the present invention, the program code executes byprocessing circuits in the Source Service Processor 110 and theDestination Service Processor 120 include, for each processor, a kernel115 125 of an Operating System executed on the Source Service Processor110 and the Destination Service Processor 120. In an embodiment of thepresent invention, the Source Service Processor 110 and the DestinationService Processor 120 execute instructions that start a kernel 115 125that starts and runs an Operating System, including but not limited to,a native Linux Operating System.

In an embodiment of the present invention, the Operating System of theService Processor 110 and the Destination Service Processor 120 each runa host Kernel-based Virtual Machine (KVM/VM) 117 127 that executesinstructions to create a virtual machine 112 122, which runs avirtualized (guest) version of the Operating System 112 122. When theOperating System is Linux, the native Linux runs the host KVM/VM andcreates a virtualized Linux and the virtualized Linux in a standardLinux KVM/VM guest virtual machine. In an embodiment of the presentinvention, the virtualization on the service processors is managed by anOpen Source 155 management software that hooks into Libvirt 132 142 ofeach service processor to manage the virtualization environment runningon these servers. Libvirt 132 142 is a toolkit that interacts with thevirtualization capabilities various Operating Systems, including but notlimited to, Linux.

Certain elements of the technical environment 100 of FIG. 1 may bespecific to embodiments of the present invention that utilize POWERServers. As seen in FIG. 1, the Source Service Processor 110 and theDestination Service Processor 120 include Translation Control Entries(TCE) 131 141, which isolate IO access in memory. TCEs are used for theI/O address to memory address translation in order to perform directmemory access (DMA) transfers between memory and Peripheral ComponentInterconnect (PCI) adapters. This embodiment also includes Remote DirectMemory Access (RDMA) Adaptors 116 126 and may also utilize a CoherentAttached Processor Interface (CAPI) to speed up the transfer of thecheckpoints via more direct memory to memory transfer operation, forexample, from local storage 135, to mirrored local storage 145.

In an embodiment of the present invention, one or more processingcircuits in the Source Service Processor 110 and the Destination ServiceProcessor 120 run program code (e.g., functions) inside the virtualmachines 112 122 that, previous to the implementation of the describedaspects of an embodiment of the present invention, would run on aservice processor. For example, in an embodiment of the presentinvention where the Source Service Processor 110 is an FSP, whichconnects the managed system to the Hardware Management Console, (HMC)the functionality that was formerly provided by the firmware of the FSPon the FSP, but is executed by program code of the virtual machine 112provides diagnostics, initialization, configuration, run-time errordetection and correction. In order for the virtual machines 112 122 toexecute functionality formerly executed directly by the Source ServiceProcessor 110 and/or the Destination Service Processor 120, in anembodiment of the present invention, the KVM/VMs 117 127 execute anemulator, for example a Quick Emulator (QEMU) 114 124 component. In anembodiment of the present invention, the virtual machine 112 122executes applications 133 143 on a guest operating system 113 123.

In an embodiment of the present invention, once the Source ServiceProcessor 110 and the Destination Service Processor 120 have executedinstructions to initialize an Operating System, one or more processingcircuits of these processors initializes program code to checkpoint theSource Service Processor 110. After the virtual machine 112 starts(boots), program code that includes checkpointing, underneath thevirtual machine 112 and inside an emulator 114 run on the KVM/VM 117,including but not limited to a QEMU, obtains a parameter of the locationof the secondary service processor, the Destination Service Processor120. In an embodiment of the present invention, the program code obtainsthis parameter either through the external Ethernet link available onall service processors, or by modifying the existing checkpointingprotocol to use some other internal link to communicate between serviceprocessors. In FIG. 1, connections 130 between the Source ServiceProcessor 110 and the Destination Service Processor 120 are illustratedand in one example, the program code may obtain the parameter over theillustrated connections 130 between the Source Service Processor 110 andthe Destination Service Processor 120. In an embodiment of the presentinvention, the micro-checkpointing 140 is executed by program codedirectly on the Source Service Processor 110 and performs themicro-checkpointing operations on the functionality that is executed inthe virtual machine 112.

Referring to FIG. 1, by coupling Source Service Processor 110 and theDestination Service Processor 120, embodiments of the present inventioninclude a high availability service processor system. Program codeexecuting on one or more processing circuits of the service processorsutilizes processor and memory state checkpointing. To enable this highavailability, program code synchronizes the service processors, theSource Service Processor 110 and the Destination Service Processor 120,which allows for continuity when a service processor fails. Duringoperation, the Source Service Processor 110 and the Destination ServiceProcessor 120 may communicate utilizing handshaking mechanisms. Toenable a failover from one service processor to the other, embodimentsof the present invention utilize interdependent locks between the twophysically separate service processors, Source Service Processor 110 andthe Destination Service Processor 120. Meanwhile, program code sends andreceives a periodic signal (heartbeat) between the Source ServiceProcessor 110 and the Destination Service Processor 120 at a regularinterval (e.g., every few seconds) to synchronize these computing nodes.Exchanging heartbeats enables the program code to monitor theDestination Service Processor 120 to achieve substantially uninterruptedservice processor functionality during failover and recovery to theDestination Service Processor 120. In an embodiment of the presentinvention, program code synchronizing the processors enables themanagement and seamless recovery of service processor registry files,VPD (vital product data) and the HMC. In an embodiment of the presentinvention, the described monitoring could be accomplished separatehardware (e.g., a watch dog timer), instead of utilizing a DestinationService Processor 120 to ping the Source Service Processor 110. However,this latter approach would require additional specialized hardware,which can be advantageously avoided in certain embodiments of thepresent invention.

As understood by one of skill in the art, in an embodiment of thepresent invention, one or more programs (e.g., hardware) control whichservice processor executes work and also when to fail over a givenservice processor (e.g., Source Service Processor 110) to a backup(e.g., Destination Service Processor 120). The one or more programs stopthe initial service processor from executing any work briefly while oneor more programs copy the dirty memory to buffer memory in the initialservice processor. When the one or more programs complete copying thedirty memory, the work can resume, but not IO, which remain halted untilthe buffer in the primary is sent to the back up. For example, in FIG.1, the program code, at the time of a checkpoint, sends the dirtymemory, at the time of a checkpoint, through the Source RDMA adapter 116(i.e., the primary service processor) to Destination RDMA Adaptor 126(i.e., the backup service processor). During this time, the SourceService Processor 110 can continue dirtying memory because the snap shotcaptured from the dirty memory since the snap shot was captured in thebuffer, but the Source Service Processor 110 cannot execute IO. Thus,one or more programs control the stopping of the Source ServiceProcessor 110, allow the Source Service Processor 110 to continue oncethe dirty memory is transferred to the Destination Service Processor120. FIG. 6 is a timing chart that illustrates this type of continuitywhere one or more programs preserve as much of the normal operation of aservice processor as possible, at all times.

FIG. 2 is a workflow 200 that illustrates program code executing thecheckpointing of a service processor in accordance with aspects of anembodiment of the present invention. Because, as illustrated in FIG. 1,in embodiments of the present invention program code executesfunctionality in a virtual machine, rather than directly on the serviceprocessor, in order to checkpoint the processes executed by the serviceprocessor, the program code micro-checkpoints the virtual machineexecuting the processes. As understood by one of skill in the art,micro-checkpointing is a method for providing fault tolerance to arunning virtual machine with little or no runtime assistance from theguest kernel or guest application software.

As aforementioned, in an embodiment of the present invention, theprogram code starts initializing micro-checkpointing code (e.g.,underneath the VM inside QEMU) by obtaining the location of thesecondary service processor (e.g., as a parameter) (210). The programcode runs the micro-checkpointing code and stops the VM in the primaryservice processor from executing instructions and IO operations after aspecified or dynamic amount of time (e.g., a specified or dynamic numberof milliseconds) (220). The program code then generates amicro-checkpoint by invoking a code path of the program code to identifyand copy dirty memory into a local staging area (230). For example, inan embodiment of the present invention, the one or more programsgenerate the micro-checkpoint by temporary halting operations in theprimary service processor and utilizes a live partition migrationfunction of the hypervisor to identify and copy dirty memory (i.e.,memory locations that have changed since the last checkpoint) to thelocal staging area. An example of a local staging area on Source ServiceProcessor 110 is Local Storage 135. In an embodiment of the presentinvention, in generating the micro-checkpoint, the program code capturesthe service processor memory state and, in certain embodiments of thepresent invention, NAND flash state, by monitoring dirty pages (i.e.,pages whose contents have changed since the last checkpoint).

As seen in FIG. 1, monitoring the dirty memory 118 of the Source ServiceProcessor 110, for example, includes scanning a the page tables 119 111of the Source Service Processor 110 and adding a bit map to mark when apage fault occurs. FIG. 3 illustrates this aspect of the presentinvention with respect to a page table 319 in a service processor. Theprogram code creates a bit map 362 by scanning a page table 319 fordirty entries. This bit map 362 structure can be utilized by the programcode faster than performing a full scan of the page table 319.

Returning to FIG. 2, in an embodiment of the present invention, theprogram code resumes the instruction execution in the virtual machine ofthe primary service processor (240). By resuming the instructionexecution virtual machine immediately, the program code enables certainfunctions of the service processor to continue, while the aspects of thecheckpointing process can continue in parallel; micro-checkpointing isalso referred to as continuous replication due to the system continuitythat it affords.

The program code transmits the micro-checkpoint to the secondary (i.e.,destination) service processor, updating the memory of the secondaryservice processor (250). In an embodiment of the present invention, thememory in the destination service processor is updated with the changessince the last checkpoint. In an embodiment of the present invention,the program code transmits the micro-checkpoint to the destinationsecondary service processor via a live partition migration function inthe hypervisor. The relationship between the initial service processorand a backup or destination service processor is discussed above withreferences to FIG. 1. As described in reference to FIG. 1, in anembodiment of the present invention, the program code detects, capturesand communicates to a dependent back up processor to be used forsubstantially seamless service processor recovery, nonvolatile storage(e.g., NAND Flash) that is updated outside the standard Linux/KVM IOand, in further embodiments, IO processes outside standard Linux IOprocesses.

Returning to FIG. 2, the program code resume IO operation in the primaryservice processor (260). In an embodiment of the present invention, theprogram code utilizes the live partition migration function of thehypervisor to allow IO operation to resume. The program code resumes IOoperations after the checkpoint information (e.g., a snapshot) intransmitted because if instructions continue (but not IO) and theprimary service processor fails before the snapshot transfer iscomplete, the program code can initiate a fail over and re-execute thoseinstructions on the destination service processor. However if IO wasissued by the primary service processor, during this period, the programcode may not be able to recover. IO operations may include a financialtransaction initiated by logging into the server, or a record update onthe hard drive; those IO operation would be lost.

As seen in FIG. 2, provided that the primary service processor continuesto operate normally, the micro-checkpointing process continues (265)(220). The primary service processor would fail to operate normally inthe event of a technical failure of this processor of if it becomesunresponsive. Failures includes broken hardware, code bugs, or anycondition causes the service processor to stop operating within itsexpected performance specifications and design parameters. However, inthe event that the primary service processor ceases to operate normally(265), the program code initiates a fail over to resume virtual machineoperation on the secondary service processor (270). Because, asillustrated in FIG. 1, because the initial service processor, theprimary service processor, and the backup service processor, thesecondary service processor, are synchronized, the swap of the serviceprocessors can be accomplished with only minor service interruption.

In an embodiment of the present invention, because the functionality ofthe service processor is executed in a virtual machine, rather thandirectly by the service processor, when the program code checkpoints thevirtual machine, the checkpoint provides a full accounting of theinformation necessary to restart this functionality by restarting thevirtual machine with the last available checkpoint. When thisfunctionality is executed directly on the service processor, in order torestore the functionality after the failure, the service processor mustindividually handle each specific service processor function that isrequired to have its state preserved for recovery by a secondary serviceprocessor, since preservation of the memory state alone is substantiallysufficient including volatile memory, real time clock, CEC hardwarecontent, system model and machine state.

In certain embodiments of the present invention, certain functionalityexecuted by a virtual machine that is generally run on the serviceprocessor directly, may or may not be protected. Thus, after a failure,the unprotected functions may need to reconnect with the newly recoveredvirtual machine and establish awareness that the secondary serviceprocessor is not the primary service processor.

As illustrated in FIGS. 1-2, embodiments of the present inventioninclude program code executed by a processing circuit in a first serviceprocessor of a distributed computing environment obtains a location of asecond service processor, where the second service processor iscommunicatively coupled to the first service processor. The program codestops a virtual machine during runtime (e.g., after a specified numberof milliseconds or a dynamic number of milliseconds), where the virtualmachine is executed by a hypervisor running on the first serviceprocessor, and where during runtime, the virtual machine executes one ormore processes to service and manage computing resources in thedistributed computing environment. The program code generates amicro-checkpoint of the virtual machine by invoking a code path of theone or more processes to identify and copy dirty memory into a localstaging area on the first service processor. The program code resumesthe virtual machine and transmits the micro-checkpoint to a secondservice processor based on the location, where program code executed bya processing circuit of the second service processor utilizes themicro-checkpoint to enable a hypervisor on the second service processorto start a virtual machine on the second service processor. In anembodiment of the present invention, the first service processor and thesecond service processor are physically separate service processors.

In an embodiment of the present invention, program code executed by theprocessing circuit of the second service processor utilizes themicro-checkpoint to enable the hypervisor on the second serviceprocessor to start the virtual machine on the second service processor,based on obtaining an indication of a failure of the first serviceprocessor.

In an embodiment of the present invention, the program code executed bythe processing circuit of the first service processor, transmits a firstperiodic signal to the second service processor, at a regular intervaland obtains, from the second service processor, a second periodicsignal, at the regular interval. In an embodiment that includes thisheartbeat functionality, when the program code transmits themicro-checkpoint, it may first verify that the first service processorobtained the second periodic signal from the second service processorduring the regular interval that is within a pre-defined timelimitation. In an embodiment of the present invention, the program codemay synchronize the first service processor to the second serviceprocessor by exchanging registry information with the second serviceprocessor utilizing a handshaking mechanism.

In an embodiment of the present invention, program code executed by theprocessing circuit of the first service processor may obtain anindication that the virtual machine of the second service processor isexecuting the one or more processes to service and manage the computingresources in the distributed computing environment. Based on obtainingthe indication, the program code may enable an interdependent lock onthe first service processor. In an embodiment of the present invention,the program code may also obtain a micro-checkpoint from the secondservice processor, retain the micro-checkpoint, obtain an indication ofa failure of the second service processor, and based on the indication,load the micro-checkpoint into a memory communicatively coupled to theprocessing circuit and utilize the micro-checkpoint to re-start thevirtual machine on the first service processor.

In an embodiment of the present invention, the micro-checkpoint includesa memory state of the first service processor and a state of a structurethat holds nonvolatile data in the first service processor. In anembodiment of the present invention, the transmitting of themicro-checkpoint may also include the program code communicating a stateof input/output processes executed in the virtual machine executing onthe first service processor.

In an embodiment of the present invention, prior to obtaining thelocation, the program code starts the virtual machine on the firstservice processor. To start the virtual machine, the program code loadsinstructions for loading and running an operating system on the firstservice processor, starts a kernel that runs the operating system,executes, by the operating system, instructions to start the hypervisor,and starts, via the hypervisor, the virtual machine on the first serviceprocessor, where the virtual machine runs a virtual operating system.

Turning now to FIG. 4, in an embodiment of the present invention,program code leverages live partition migration code paths tomicro-check a service processor without utilizing a virtualizedenvironment. As seen in this technical environment 400, applications 471481 execute on the Source Service Processor 410 and/or the DestinationService Processor 120 and these applications can change the memory ofthe respective service processors. The memory in the service processorthat has changed since the last checkpoint is the dirty memory 418 428.The applications 471 481 may include VMs running as well as thehypervisor. The applications 471 481 include program code running on theSource Service Processor 410 and/or the Destination Service Processor120, respectively. As will be explained further in FIG. 5, in order togenerate a micro-checkpoint, program code (micro-checkpoints 440) stopsthe program code executing on a service processor (e.g., Source ServiceProcessor 410) and detects and checkpoints the memory in the serviceprocessor that has changed since the last checkpoint (e.g., the dirtymemory 418).

As understood by one of skill in the art, due to the high-frequencynature of micro-checkpointing, program code in an embodiment of thepresent invention generates a new micro-checkpoint many times per secondbecause missing just a few micro-checkpoints could constitutes afailure. Because of the consistent buffering of device I/O, this is safebecause device IO is not committed to the outside world until themicro-checkpoint has been received at the destination.

FIG. 5 is a workflow 500 that illustrates program code executing thecheckpointing of a service processor in accordance with aspects of anembodiment of the present invention. As understood by one of skill inthe art and explained earlier, the program code may include bothhardware and software and, in the context of the embodiment of FIG. 5,includes software that can be used alone or in conjunction withprocessing circuits in hardware. This workflow 500 is illustrated, forease of understanding, with examples from FIG. 4. In an embodiment ofthe present invention, program code executing on a service processor(e.g., Source Service Processor 410) starts initializingmicro-checkpointing code by obtaining the location of the secondaryservice processor (e.g., Destination Service Processor 420) (e.g., as aparameter) in order to start micro-checkpointing of code (e.g., one ormore programs) running on the primary service processor (510).

After a specified or dynamic amount of time, one or more programs stopthe primary service processor from executing instructions and IOoperations (520). The program code then generates a micro-checkpoint byinvoking a code path of the program code to identify and copy dirtymemory into a local staging area (e.g., Local Storage 435) (530). In anembodiment of the present invention, the program code generates themicro-checkpoint by temporary halting operations in the primary serviceprocessor while it identifies and copies dirty memory (i.e., memorylocations that have changed since the last checkpoint) to a localstaging area. In an embodiment of the present invention, in generatingthe micro-checkpoint, the program code captures the service processormemory state and, in certain embodiments of the present invention, NANDflash state, by monitoring the dirty pages.

In an embodiment of the present invention, the program code resumesinstruction execution in the primary service processor (540) so thatcertain of the service processor functions continue while the aspects ofthe checkpointing process continue in parallel. In an embodiment of thepresent invention, the program code allows the primary service processorto resume instruction execution. The program code transmits themicro-checkpoint to the secondary service processor, updating a memoryof the secondary service processor (550). In an embodiment of thepresent invention, upon receipt of the transmission, program codeupdates the memory in the secondary service processor is updated withthe changes since the last checkpoint. In an embodiment of the presentinvention, the program code resumes IO operation in the primary serviceprocessor (560).

Provided that the primary service processor continues to operatenormally, the micro-checkpointing process continues (565) (520). In theevent that the primary service processor ceases to operate normally(565), the program code switches from primary service processor tosecondary service processor and resumes normal operation (570).

In the case where this process fails (555), the program code willgenerate a micro-checkpoint (520) and repeat (557) this describedprocess, as illustrated in FIG. 5. Upon failure of this process, theprogram code loads the contents of the last micro-checkpoint at thedestination back into memory, and runs the one or more programs on thedestination processor (560) (e.g., applications 481). In the event thatthe service processor fails (558), the program code will load thecontents of the last micro-checkpoint at the destination serviceprocessor back into memory and run the one or more programs on thedestination service processor (560). The program code can initiate thevirtual machine on a backup service processor, rather than on theinitial service processor, which experienced the failure. Due to theheartbeat functionality aspect of the present invention, accomplishedbased on the connectivity between the service processors (e.g.,connections 430), the initial service processor and the backup serviceprocessor are synchronized, so the swap of the service processors can beaccomplished with only minor service interruption. If there is nofailure, micro-checkpointing continues.

It is understood in advance that although this disclosure includes adetailed description of cloud computing, implementation of the teachingsrecited herein are not limited to a cloud computing environment. Rather,embodiments of the present invention are capable of being implemented inconjunction with any other type of computing environment now known orlater developed.

Cloud computing is a model of service delivery for enabling convenient,on-demand network access to a shared pool of configurable computingresources (e.g., networks, network bandwidth, servers, processing,memory, storage, applications, virtual machines, and services) that canbe rapidly provisioned and released with minimal management effort orinteraction with a provider of the service. This cloud model may includeat least five characteristics, at least three service models, and atleast four deployment models.

Characteristics are as follows:

On-demand self-service: a cloud consumer can unilaterally provisioncomputing capabilities, such as server time and network storage, asneeded automatically without requiring human interaction with theservice's provider.

Broad network access: capabilities are available over a network andaccessed through standard mechanisms that promote use by heterogeneousthin or thick client platforms (e.g., mobile phones, laptops, and PDAs).

Resource pooling: the provider's computing resources are pooled to servemultiple consumers using a multi-tenant model, with different physicaland virtual resources dynamically assigned and reassigned according todemand. There is a sense of location independence in that the consumergenerally has no control or knowledge over the exact location of theprovided resources but may be able to specify location at a higher levelof abstraction (e.g., country, state, or datacenter).

Rapid elasticity: capabilities can be rapidly and elasticallyprovisioned, in some cases automatically, to quickly scale out andrapidly released to quickly scale in. To the consumer, the capabilitiesavailable for provisioning often appear to be unlimited and can bepurchased in any quantity at any time.

Measured service: cloud systems automatically control and optimizeresource use by leveraging a metering capability at some level ofabstraction appropriate to the type of service (e.g., storage,processing, bandwidth, and active user accounts). Resource usage can bemonitored, controlled, and reported providing transparency for both theprovider and consumer of the utilized service.

Service Models are as follows:

Software as a Service (SaaS): the capability provided to the consumer isto use the provider's applications running on a cloud infrastructure.The applications are accessible from various client devices through athin client interface such as a web browser (e.g., web-based email). Theconsumer does not manage or control the underlying cloud infrastructureincluding network, servers, operating systems, storage, or evenindividual application capabilities, with the possible exception oflimited user-specific application configuration settings.

Platform as a Service (PaaS): the capability provided to the consumer isto deploy onto the cloud infrastructure consumer-created or acquiredapplications created using programming languages and tools supported bythe provider. The consumer does not manage or control the underlyingcloud infrastructure including networks, servers, operating systems, orstorage, but has control over the deployed applications and possiblyapplication hosting environment configurations.

Infrastructure as a Service (IaaS): the capability provided to theconsumer is to provision processing, storage, networks, and otherfundamental computing resources where the consumer is able to deploy andrun arbitrary software, which can include operating systems andapplications. The consumer does not manage or control the underlyingcloud infrastructure but has control over operating systems, storage,deployed applications, and possibly limited control of select networkingcomponents (e.g., host firewalls).

Deployment Models are as follows:

Private cloud: the cloud infrastructure is operated solely for anorganization. It may be managed by the organization or a third party andmay exist on-premises or off-premises.

Community cloud: the cloud infrastructure is shared by severalorganizations and supports a specific community that has shared concerns(e.g., mission, security requirements, policy, and complianceconsiderations). It may be managed by the organizations or a third partyand may exist on-premises or off-premises.

Public cloud: the cloud infrastructure is made available to the generalpublic or a large industry group and is owned by an organization sellingcloud services.

Hybrid cloud: the cloud infrastructure is a composition of two or moreclouds (private, community, or public) that remain unique entities butare bound together by standardized or proprietary technology thatenables data and application portability (e.g., cloud bursting forloadbalancing between clouds).

A cloud computing environment is service oriented with a focus onstatelessness, low coupling, modularity, and semantic interoperability.At the heart of cloud computing is an infrastructure comprising anetwork of interconnected nodes. Aspects of embodiments of the presentinvention, including the checkpointing and the management of adistributed environment with at least two service processors can beutilized to manage a distributed network such as a cloud computingsystem.

Referring now to FIG. 7, a schematic of an example of a cloud computingnode is shown. Cloud computing node 10 is only one example of a suitablecloud computing node and is not intended to suggest any limitation as tothe scope of use or functionality of embodiments of the inventiondescribed herein. Regardless, cloud computing node 10 is capable ofbeing implemented and/or performing any of the functionality set forthhereinabove. In an embodiment of the present invention, the one or moreprocessors utilized integrated into and/or utilized by program code inthe display device, and computing nodes from which the program codeobtains product models can be understood as a cloud computing node 10(FIG. 7) and if not a cloud computing node 10, then a general computingnode that includes aspects of the cloud computing node 10.

In cloud computing node 10 there is a computer system/server 12, whichis operational with numerous other general purpose or special purposecomputing system environments or configurations. Examples of well-knowncomputing systems, environments, and/or configurations that may besuitable for use with computer system/server 12 include, but are notlimited to, personal computer systems, server computer systems, thinclients, thick clients, handheld or laptop devices, multiprocessorsystems, microprocessor-based systems, set top boxes, programmableconsumer electronics, network PCs, minicomputer systems, mainframecomputer systems, and distributed cloud computing environments thatinclude any of the above systems or devices, and the like.

Computer system/server 12 may be described in the general context ofcomputer system-executable instructions, such as program modules, beingexecuted by a computer system. Generally, program modules may includeroutines, programs, objects, components, logic, data structures, and soon that perform particular tasks or implement particular abstract datatypes. Computer system/server 12 may be practiced in distributed cloudcomputing environments where tasks are performed by remote processingdevices that are linked through a communications network. In adistributed cloud computing environment, program modules may be locatedin both local and remote computer system storage media including memorystorage devices.

As shown in FIG. 7, computer system/server 12 in cloud computing node 10is shown in the form of a general-purpose computing device. Thecomponents of computer system/server 12 may include, but are not limitedto, one or more processors or processing units 16, a system memory 28,and a bus 18 that couples various system components including systemmemory 28 to processor 16.

Bus 18 represents one or more of any of several types of bus structures,including a memory bus or memory controller, a peripheral bus, anaccelerated graphics port, and a processor or local bus using any of avariety of bus architectures. By way of example, and not limitation,such architectures include Industry Standard Architecture (ISA) bus,Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, VideoElectronics Standards Association (VESA) local bus, and PeripheralComponent Interconnect (PCI) bus.

Computer system/server 12 typically includes a variety of computersystem readable media. Such media may be any available media that isaccessible by computer system/server 12, and it includes both volatileand non-volatile media, removable and non-removable media.

System memory 28 can include computer system readable media in the formof volatile memory, such as random access memory (RAM) 30 and/or cachememory 32. Computer system/server 12 may further include otherremovable/non-removable, volatile/non-volatile computer system storagemedia. By way of example only, storage system 34 can be provided forreading from and writing to a non-removable, non-volatile magnetic media(not shown and typically called a “hard drive”). Although not shown, amagnetic disk drive for reading from and writing to a removable,non-volatile magnetic disk (e.g., a “floppy disk”), and an optical diskdrive for reading from or writing to a removable, non-volatile opticaldisk such as a CD-ROM, DVD-ROM or other optical media can be provided.In such instances, each can be connected to bus 18 by one or more datamedia interfaces. As will be further depicted and described below,memory 28 may include at least one program product having a set (e.g.,at least one) of program modules that are configured to carry out thefunctions of embodiments of the invention.

Program/utility 40, having a set (at least one) of program modules 42,may be stored in memory 28 by way of example, and not limitation, aswell as an operating system, one or more application programs, otherprogram modules, and program data. Each of the operating system, one ormore application programs, other program modules, and program data orsome combination thereof, may include an implementation of a networkingenvironment. Program modules 42 generally carry out the functions and/ormethodologies of embodiments of the invention as described herein.

Computer system/server 12 may also communicate with one or more externaldevices 14 such as a keyboard, a pointing device, a display 24, etc.;one or more devices that enable a user to interact with computersystem/server 12; and/or any devices (e.g., network card, modem, etc.)that enable computer system/server 12 to communicate with one or moreother computing devices. Such communication can occur via Input/Output(I/O) interfaces 22. Still yet, computer system/server 12 cancommunicate with one or more networks such as a local area network(LAN), a general wide area network (WAN), and/or a public network (e.g.,the Internet) via network adapter 20. As depicted, network adapter 20communicates with the other components of computer system/server 12 viabus 18. It should be understood that although not shown, other hardwareand/or software components could be used in conjunction with computersystem/server 12. Examples include, but are not limited to: microcode,device drivers, redundant processing units, external disk drive arrays,RAID systems, tape drives, and data archival storage systems, etc.

Referring now to FIG. 8, illustrative cloud computing environment 50 isdepicted. As shown, cloud computing environment 50 comprises one or morecloud computing nodes 10 with which local computing devices used bycloud consumers, such as, for example, personal digital assistant (PDA)or cellular telephone 54A, desktop computer 54B, laptop computer 54C,and/or automobile computer system 54N may communicate. Nodes 10 maycommunicate with one another. They may be grouped (not shown) physicallyor virtually, in one or more networks, such as Private, Community,Public, or Hybrid clouds as described hereinabove, or a combinationthereof. This allows cloud computing environment 50 to offerinfrastructure, platforms and/or software as services for which a cloudconsumer does not need to maintain resources on a local computingdevice. It is understood that the types of computing devices 54A-N shownin FIG. 4 are intended to be illustrative only and that computing nodes10 and cloud computing environment 50 can communicate with any type ofcomputerized device over any type of network and/or network addressableconnection (e.g., using a web browser).

Referring now to FIG. 9, a set of functional abstraction layers providedby cloud computing environment 50 (FIG. 8) is shown. It should beunderstood in advance that the components, layers, and functions shownin FIG. 9 are intended to be illustrative only and embodiments of theinvention are not limited thereto. As depicted, the following layers andcorresponding functions are provided:

Hardware and software layer 60 includes hardware and softwarecomponents. Examples of hardware components include mainframes 61; RISC(Reduced Instruction Set Computer) architecture based servers 62;servers 63; blade servers 64; storage devices 65; and networks andnetworking components 66. In some embodiments, software componentsinclude network application server software 67 and database software 68.

Virtualization layer 70 provides an abstraction layer from which thefollowing examples of virtual entities may be provided: virtual servers71; virtual storage 72; virtual networks 73, including virtual privatenetworks; virtual applications and operating systems 74; and virtualclients 75.

In one example, management layer 80 may provide the functions describedbelow, which may include maintaining VPD at a VPD location the computersystem. Resource provisioning 81 provides dynamic procurement ofcomputing resources and other resources that are utilized to performtasks within the cloud computing environment. Metering and Pricing 82provide cost tracking as resources are utilized within the cloudcomputing environment, and billing or invoicing for consumption of theseresources. In one example, these resources may comprise applicationsoftware licenses. Security provides identity verification for cloudconsumers and tasks, as well as protection for data and other resources.User portal 83 provides access to the cloud computing environment forconsumers and system administrators. Service level management 84provides cloud computing resource allocation and management such thatrequired service levels are met. Service Level Agreement (SLA) planningand fulfillment 85 provide pre-arrangement for, and procurement of,cloud computing resources for which a future requirement is anticipatedin accordance with an SLA.

Workloads layer 90 provides examples of functionality for which thecloud computing environment may be utilized. Examples of workloads andfunctions which may be provided from this layer include: mapping andnavigation 91; software development and lifecycle management 92; virtualclassroom education delivery 93; data analytics processing 94;transaction processing 95; and providing state information to a serviceprocessor in an environment during a checkpointing process 96.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions.

These computer readable program instructions may be provided to aprocessor of a general purpose computer, special purpose computer, orother programmable data processing apparatus to produce a machine, suchthat the instructions, which execute via the processor of the computeror other programmable data processing apparatus, create means forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. These computer readable program instructionsmay also be stored in a computer readable storage medium that can directa computer, a programmable data processing apparatus, and/or otherdevices to function in a particular manner, such that the computerreadable storage medium having instructions stored therein comprises anarticle of manufacture including instructions which implement aspects ofthe function/act specified in the flowchart and/or block diagram blockor blocks.

The computer readable program instructions may also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational steps to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams may represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the block may occur out of theorder noted in the figures. For example, two blocks shown in successionmay, in fact, be executed substantially concurrently, or the blocks maysometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

In addition to the above, one or more aspects may be provided, offered,deployed, managed, serviced, etc. by a service provider who offersmanagement of customer environments. For instance, the service providercan create, maintain, support, etc. computer code and/or a computerinfrastructure that performs one or more aspects for one or morecustomers. In return, the service provider may receive payment from thecustomer under a subscription and/or fee agreement, as examples.Additionally or alternatively, the service provider may receive paymentfrom the sale of advertising content to one or more third parties.

In one aspect, an application may be deployed for performing one or moreembodiments. As one example, the deploying of an application comprisesproviding computer infrastructure operable to perform one or moreembodiments.

As a further aspect, a computing infrastructure may be deployedcomprising integrating computer readable code into a computing system,in which the code in combination with the computing system is capable ofperforming one or more embodiments.

As yet a further aspect, a process for integrating computinginfrastructure comprising integrating computer readable code into acomputer system may be provided. The computer system comprises acomputer readable medium, in which the computer medium comprises one ormore embodiments. The code in combination with the computer system iscapable of performing one or more embodiments.

Although various embodiments are described above, these are onlyexamples. For example, computing environments of other architectures canbe used to incorporate and use one or more embodiments. Further,different instructions, instruction formats, instruction fields and/orinstruction values may be used. Many variations are possible.

Further, other types of computing environments can benefit and be used.As an example, a data processing system suitable for storing and/orexecuting program code is usable that includes at least two processorscoupled directly or indirectly to memory elements through a system bus.The memory elements include, for instance, local memory employed duringactual execution of the program code, bulk storage, and cache memorywhich provide temporary storage of at least some program code in orderto reduce the number of times code must be retrieved from bulk storageduring execution.

Input/Output or I/O devices (including, but not limited to, keyboards,displays, pointing devices, DASD, tape, CDs, DVDs, thumb drives andother memory media, etc.) can be coupled to the system either directlyor through intervening I/O controllers. Network adapters may also becoupled to the system to enable the data processing system to becomecoupled to other data processing systems or remote printers or storagedevices through intervening private or public networks. Modems, cablemodems, and Ethernet cards are just a few of the available types ofnetwork adapters.

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting. As used herein, thesingular forms “a”, “an” and “the” are intended to include the pluralforms as well, unless the context clearly indicates otherwise. It willbe further understood that the terms “comprises” and/or “comprising”,when used in this specification, specify the presence of statedfeatures, integers, steps, operations, elements, and/or components, butdo not preclude the presence or addition of one or more other features,integers, steps, operations, elements, components and/or groups thereof.

The corresponding structures, materials, acts, and equivalents of allmeans or step plus function elements in the claims below, if any, areintended to include any structure, material, or act for performing thefunction in combination with other claimed elements as specificallyclaimed. The description of one or more embodiments has been presentedfor purposes of illustration and description, but is not intended to beexhaustive or limited to in the form disclosed. Many modifications andvariations will be apparent to those of ordinary skill in the art. Theembodiment was chosen and described in order to best explain variousaspects and the practical application, and to enable others of ordinaryskill in the art to understand various embodiments with variousmodifications as are suited to the particular use contemplated.

What is claimed is:
 1. A computer-implemented method, comprising:obtaining, by a processing circuit in a first service processor in adistributed computing environment, a location of a second serviceprocessor, wherein the second service processor is communicativelycoupled to the first service processor; stopping, by the processingcircuit, a virtual machine during runtime, wherein the virtual machineis executed by a hypervisor running on the first service processor, andwherein during runtime, the virtual machine executes one or moreprocesses to service and manage computing resources in the distributedcomputing environment, wherein the stopping comprises stoppinginstruction execution and stopping IO operations in the first serviceprocessor, and wherein based on the stopping the instruction executionand the IO operations are suspended; generating, by the processingcircuit, a micro-checkpoint of the virtual machine by invoking a codepath of the one or more processes to identify and copy dirty memory intoa local staging area on the first service processor; determining, by theprocessing circuit, that copying the dirty memory into the local stagingarea on the first service processor is complete; based on determiningthat the copying is complete resuming, by processing circuit,instruction execution in the first service processor by the virtualmachine and continuing suspension of the IO operations in the firstservice processor; transmitting, by processing circuit, themicro-checkpoint to a second service processor based on the location,wherein a processing circuit of the second service processor utilizesthe micro-checkpoint to enable a hypervisor on the second serviceprocessor to start a virtual machine on the second service processor;determining, by the processing circuit, that the transmitting iscomplete; and based on determining that the transmitting is complete,resuming, by the processing circuit, IO operations in the first serviceprocessor.
 2. The computer-implemented of claim 1, wherein theprocessing circuit of the second service processor utilizes themicro-checkpoint to enable the hypervisor on the second serviceprocessor to start the virtual machine on the second service processor,based on obtaining, by the processing circuit of the second serviceprocessor, an indication of a failure of the first service processor. 3.The computer-implemented of claim 1, further comprising: transmitting,by the processing circuit of the first service processor, to the secondservice processor, a first periodic signal at a regular interval; andobtaining, by the processing circuit, from the second service processor,a second periodic signal at the regular interval.
 4. Thecomputer-implemented of claim 3, wherein the transmitting themicro-checkpoint further comprises: verifying, by the processingcircuit, that the processing circuit obtained the second periodic signalfrom the second service processor during the regular interval within apre-defined time limitation.
 5. The computer-implemented method of claim1, further comprising: synchronizing, by the processing circuit of thefirst service processor, the first service processor to the secondservice processor by exchanging information with the second serviceprocessor utilizing a handshaking mechanism.
 6. The computer-implementedmethod of claim 5, the information comprising at least one of: registryinformation, virtual product data, or shared data.
 7. Thecomputer-implemented method of claim 1, further comprising: obtaining,by the processing circuit of the first service processor, an indicationthat the virtual machine of the second service processor is executingthe one or more processes; and based on obtaining the indication,enabling, by the processing circuit, an interdependent lock on the firstservice processor.
 8. The computer-implemented method of claim 7,further comprising: obtaining, by the processing circuit of the firstservice processor, a micro-checkpoint from the second service processorand retaining the micro-checkpoint; obtaining, by the processingcircuit, an indication of a failure of the second service processor;based on the indication, loading, by the processing circuit, themicro-checkpoint into a memory communicatively coupled to the processingcircuit; and utilizing, by the processing circuit, the micro-checkpointto re-start the virtual machine on the first service processor.
 9. Thecomputer-implemented method of claim 1, wherein the first serviceprocessor and the second service processor are physically separateservice processors.
 10. The computer-implemented of claim 1, wherein thestopping of the virtual machine is after one of: a specified number ofmilliseconds or a dynamic number of milliseconds.
 11. Thecomputer-implemented of claim 1, wherein the micro-checkpoint comprisesa memory state of the first service processor and a state of a structurethat holds nonvolatile data in the first service processor.
 12. Thecomputer-implemented of claim 1, wherein the transmitting furthercomprises communicating a state of input/output processes executed inthe virtual machine executing on the first service processor.
 13. Thecomputer-implemented of claim 1, further comprising: prior to obtainingthe location, starting, by the processing circuit of the first serviceprocessor, the virtual machine on the first service processor, whereinthe starting comprises: loading, by the processing circuit, instructionsfor loading and running an operating system on the first serviceprocessor; starting, by the processing circuit, a kernel that runs theoperating system; executing, by the operating system, instructions tostart the hypervisor; and starting, by the hypervisor, the virtualmachine on the first service processor, wherein the virtual machine runsa virtual operating system.
 14. A computer program product comprising: acomputer readable storage medium readable by one or more processingcircuits and storing instructions for execution by the one or moreprocessing circuits for performing a method comprising: obtaining, by aprocessing circuit in a first service processor in a distributedcomputing environment, a location of a second service processor, whereinthe second service processor is communicatively coupled to the firstservice processor; stopping, by the processing circuit, a virtualmachine during runtime, wherein the virtual machine is executed by ahypervisor running on the first service processor, and wherein duringruntime, the virtual machine executes one or more processes to serviceand manage computing resources in the distributed computing environment,wherein the stopping comprises stopping instruction execution andstopping IO operations in the first service processor, and wherein basedon the stopping the instruction execution and the IO operations aresuspended; generating, by the processing circuit, a micro-checkpoint ofthe virtual machine by invoking a code path of the one or more processesto identify and copy dirty memory into a local staging area on the firstservice processor; determining, by the processing circuit, that copyingthe dirty memory into the local staging area on the first serviceprocessor is complete; based on determining that the copying is completeresuming, by processing circuit, instruction execution in the firstservice processor by the virtual machine and continuing suspension ofthe IO operations in the first service processor; transmitting, byprocessing circuit, the micro-checkpoint to a second service processorbased on the location, wherein a processing circuit of the secondservice processor utilizes the micro-checkpoint to enable a hypervisoron the second service processor to start a virtual machine on the secondservice processor; determining, by the processing circuit, that thetransmitting is complete; and based on determining that the transmittingis complete, resuming, by the processing circuit, IO operations in thefirst service processor.
 15. The computer program product of claim 14,wherein the processing circuit of the second service processor utilizesthe micro-checkpoint to enable the hypervisor on the second serviceprocessor to start the virtual machine on the second service processor,based on obtaining, by the processing circuit of the second serviceprocessor, an indication of a failure of the first service processor.16. The computer program product of claim 14, the method furthercomprising: transmitting, by the processing circuit of the first serviceprocessor, to the second service processor, a first periodic signal at aregular interval; and obtaining, by the processing circuit, from thesecond service processor, a second periodic signal at the regularinterval.
 17. The computer program product of claim 16, wherein thetransmitting the micro-checkpoint further comprises: verifying, by theprocessing circuit, that the processing circuit obtained the secondperiodic signal from the second service processor during the regularinterval within a pre-defined time limitation.
 18. The computer programproduct of claim 14, the method further comprising: synchronizing, bythe processing circuit of the first service processor, the first serviceprocessor to the second service processor by exchanging information withthe second service processor utilizing a handshaking mechanism.
 19. Thecomputer program product of claim 14, the method further comprising:obtaining, by the processing circuit of the first service processor, anindication that the virtual machine of the second service processor isexecuting the one or more processes; and based on obtaining theindication, enabling, by the processing circuit, an interdependent lockon the first service processor.
 20. A system comprising: a memory; oneor more processing circuits in communication with the memory; andprogram instructions executable by the one or more processing circuitsvia the memory to perform a method, the method comprising: obtaining, bya processing circuit in a first service processor in a distributedcomputing environment, a location of a second service processor, whereinthe second service processor is communicatively coupled to the firstservice processor; stopping, by the processing circuit, a virtualmachine during runtime, wherein the virtual machine is executed by ahypervisor running on the first service processor, and wherein duringruntime, the virtual machine executes one or more processes to serviceand manage computing resources in the distributed computing environment,wherein the stopping comprises stopping instruction execution andstopping IO operations in the first service processor, and wherein basedon the stopping the instruction execution and the IO operations aresuspended; generating, by the processing circuit, a micro-checkpoint ofthe virtual machine by invoking a code path of the one or more processesto identify and copy dirty memory into a local staging area on the firstservice processor; determining, by the processing circuit, that copyingthe dirty memory into the local staging area on the first serviceprocessor is complete; based on determining that the copying is completeresuming, by processing circuit, instruction execution in the firstservice processor by the virtual machine and continuing suspension ofthe IO operations in the first service processor; transmitting, byprocessing circuit, the micro-checkpoint to a second service processorbased on the location, wherein a processing circuit of the secondservice processor utilizes the micro-checkpoint to enable a hypervisoron the second service processor to start a virtual machine on the secondservice processor; determining, by the processing circuit, that thetransmitting is complete; and based on determining that the transmittingis complete, resuming, by the processing circuit, IO operations in thefirst service processor.